Measuring Interlanguage: Native Language Identification with L1-influence Metrics

نویسندگان

  • Julian Brooke
  • Graeme Hirst
چکیده

The task of native language (L1) identification suffers from a relative paucity of useful training corpora, and standard within-corpus evaluation is often problematic due to topic bias. In this paper, we introduce a method for L1 identification in second language (L2) texts that relies only on much more plentiful L1 data, rather than the L2 texts that are traditionally used for training. In particular, we do word-by-word translation of large L1 blog corpora to create a mapping to L2 forms that are a possible result of language transfer, and then use that information for unsupervised classification. We show this method is effective in several different learner corpora, with bigram features being particularly useful.

برای دانلود متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

ثبت نام

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

منابع مشابه

Cross-linguistic Influence in Third Language Acquisition of Voice Onset Time

The paper aims to investigate the sources of crosslinguistic influence in the acquisition of voice onset time patterns in third language phonology. Thirty two learners of L3 French with L1 Polish and L2 English were recorded reading lists of words in carrier phrases in the three respective languages. The recordings were analyzed for the degree of aspiration of voiceless stops in stressed onset ...

متن کامل

Locative Constructions in English Interlanguage: A Study of the Acquisition of Argument Realization in English by Iranian EFL Learners

Locative constructions, as an instance of double object constructions with various cognitive-linguistic concepts, have been successful in attracting the linguists. This paper tried to evaluate the acquisition of English locative constructions by Persian native speakers in the absence of negative evidence, focusing on the influence of L1 and participants’ proficiency level in the course of acqui...

متن کامل

Cross-linguistic Influence in Third Language Acquisition: Factors Influencing Interlanguage Transfer

In the field of Second Language Acquisition (SLA), the role of transfer has been theorized and studied to account for the influence of the native language on an interlanguage. However, stemming from behaviorist mentality, the concept of transfer has become widely associated with the direct transfer of habits and fails to take into consideration other cross-linguistic influences (CLI) that shape...

متن کامل

The Interlanguage of Persian Learners of Italian: a Focus on Complex Predicates

This paper aims at investigating the acquisition of Italian complex predicates by native speakers of Persian. Complex predication is not as pervasive a phenomenon in Italian as it is in Persian. Yet Italian native speakers use complex predicates productively; spontaneous data show that Persian learners of Italian seem to be perfectly aware of Italian complex predicates and use this familiar fea...

متن کامل

Native Language Identification Based on English Accent

Present work is aimed at investigating the influence of mother tongue (L1) of a South Indian speaker on a second language (L2). Second language can be a dominant local language, national language in India i.e., Hindi or a connecting language English. In the current study, L2 is a short discourse in English. Cepstral and prosodic features were used as in Language Identification (LID) to distingu...

متن کامل

ذخیره در منابع من


  با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید

برای دانلود متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

ثبت نام

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

عنوان ژورنال:

دوره   شماره 

صفحات  -

تاریخ انتشار 2012